Natural language watermarking: Challenges in building a practical system

نویسندگان

  • Mercan Topkara
  • Giuseppe Riccardi
  • Dilek Z. Hakkani-Tür
  • Mikhail J. Atallah
چکیده

This paper gives an overview of the research and implementation challenges we encountered in building an endto-end natural language processing based watermarking system. With natural language watermarking, we mean embedding the watermark into a text document, using the natural language components as the carrier, in such a way that the modifications are imperceptible to the readers and the embedded information is robust against possible attacks. Of particular interest is using the structure of the sentences in natural language text in order to insert the watermark. We evaluated the quality of the watermarked text using an objective evaluation metric, the BLEU score. BLEU scoring is commonly used in the statistical machine translation community. Our current system prototype achieves 0.45 BLEU score on a scale [0,1].

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Survey of Watermarking Techniques for Non-Media Digital Objects (Invited Talk)

The vast majority of the literature on watermarking has dealt with media such as images, video, and audio – all of which are ultimately destined for consumption by the human perceptual system. There has recently been growing interest in watermarking non-media such as relational data, software, natural language text, sensor streams, etc. The challenges posed by these new domains are quite differ...

متن کامل

Data-Driven News Generation for Automated Journalism

Despite increasing amounts of data and ever improving natural language generation techniques, work on automated journalism is still relatively scarce. In this paper, we explore the field and challenges associated with building a journalistic natural language generation system. We present a set of requirements that should guide system design, including transparency, accuracy, modifiability and t...

متن کامل

Bilingual Education and Necessity to Differentiate Two Educational Challenges for Deaf Students

Background: Some obstacles and inefficiencies in deaf education system may be attributed to the fact that the right to education and equality of opportunities for national core curriculum, and the need for learning Farsi language are not met separately among deaf students. In fact, the distinction between these two educational challenges is not addressed to deaf pupils in particular. Based on t...

متن کامل

Practical challenges for digital watermarking applications

The field of digital watermarking has recently seen numerous articles covering novel techniques, theoretical studies, attacks, and analysis. In this paper, we focus on an emerging application to highlight practical challenges for digital watermarking applications. Challenges include design considerations, requirements analysis, choice of watermarking techniques, speed, robustness, and the trade...

متن کامل

Natural language watermarking

In this paper we discuss natural language watermarking, which uses the structure of the sentence constituents in natural language text in order to insert a watermark. This approach is different from techniques, collectively referred to as “text watermarking,” which embed information by modifying the appearance of text elements, such as lines, words, or characters. We provide a survey of the cur...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2006